Clustered segmentations
نویسندگان
چکیده
The problem of sequence and time-series segmentation has been discussed widely and it has been applied successfully in a variety of areas, including computational genomics, data analysis for scientific applications, and telecommunications. In many of these areas the sequences involved are multidimensional, and the goal of the segmentation is to discover sequence segments with small variability. One of the characteristics of existing techniques is that they force all dimensions to share the same segment boundaries, yet, it is often reasonable to assume that different dimensions are more correlated than others, and that concrete and meaningful states are associated only with a subset of dimensions. In this paper we study the problem of segmenting a multidimensional sequence when the dimensions of the sequence are allowed to form clusters and be segmented separately within each cluster. We demonstrate the relevance of this problem to many data-mining applications. We discuss the connection of our setting with existing work, we show the hardness of the suggested problem, and we propose a number of algorithms for its solution. Finally, we give empirical evidence showing that our algorithms work well in practice and produce useful results.
منابع مشابه
Problems and Algorithms for Sequence Segmentations
The analysis of sequential data is required in many diverse areas such as telecommunications, stock market analysis, and bioinformatics. A basic problem related to the analysis of sequential data is the sequence segmentation problem. A sequence segmentation is a partition of the sequence into a number of non-overlapping segments that cover all data points, such that each segment is as homogeneo...
متن کاملObject tracking with spatio-temporal blob
We propose to develop a tracking algorithm of objects or humans, based on kinematics, with a fixed monochromatic camera, without any knowledge on the sequence: size, shape or number of objects are unknown and can evolve with time. For this purpose, we first make a motion detection, then, as we suppose that people move locally in a consistent way and thus draw a regular trajectory in the spatio-...
متن کاملEvaluating Text Segmentation using Boundary Edit Distance
This work proposes a new segmentation evaluation metric, named boundary similarity (B), an inter-coder agreement coefficient adaptation, and a confusion-matrix for segmentation that are all based upon an adaptation of the boundary edit distance in Fournier and Inkpen (2012). Existing segmentation metrics such as Pk, WindowDiff, and Segmentation Similarity (S) are all able to award partial credi...
متن کاملShape-Based Averaging for Combination of Multiple Segmentations
Combination of multiple segmentations has recently been introduced as an effective method to obtain segmentations that are more accurate than any of the individual input segmentations. This paper introduces a new way to combine multiple segmentations using a novel shape-based averaging method. Individual segmentations are combined based on the signed Euclidean distance maps of the labels in eac...
متن کاملImage segmentation based on merging of sub-optimal segmentations
In this paper a heuristic segmentation algorithm is presented based on the oversegmentation of an image. The method uses a set of different segmentations of the image produced previously by standard techniques. These segmentations are combined to create the oversegmented image. They can be performed using different techniques or even the same technique with different initial conditions. Based o...
متن کاملMesial temporal sclerosis and temporal lobe epilepsy: MR imaging deformation-based segmentation of the hippocampus in five patients.
In five patients with mesial temporal sclerosis, the authors verified the precision and reproducibility of hippocampal segmentations with deformation-based magnetic resonance (MR) imaging. The overall percentage overlap between automated segmentations was 92.8% (SD, 3.5%), between manual segmentations was 73.1% (SD, 9.5%), and between automated and manual segmentations was 74.8% (SD, 10.3%). De...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2004